A family of regression methods derived from standard PLSR

نویسندگان

  • Jean-Claude Boulet
  • Dominique Bertrand
  • Gérard Mazerolles
  • Robert Sabatier
  • Jean-Michel Roger
چکیده

We present a new regression method derived from standard PLSR which has a geometric point of view and consists of two projections. In the first, scores are obtained after an oblique projection of the spectra onto the loadings. In the second, the vector of response values is projected orthogonally onto the scores. A metric is introduced for the oblique projection and a new algorithm for calculating the loadings into the variable space is proposed. This work also puts forward a new parameter, a vector, whose different values lead to different regression models with their own prediction abilities, and one of them is the exact form of standard PLSR. This method (called vector orientation decided through knowledge assessment, or VODKA regression) is another way to build least squares regressions using only a few latent variables. We propose two Email addresses: [email protected] (Jean-Claude Boulet ), [email protected] (Dominique Bertrand ), [email protected] (Gérard Mazerolles ), [email protected] (Robert Sabatier), [email protected] (Jean-Michel Roger) 1corresponding author 2present address: Data Frame, 25 rue Stendhal, F-44300 Nantes, France Preprint submitted to Chemolab May 29, 2012 Author-produced version of the article published in Chemometrics and Intelligent Laboratory Systems, 2013, 120, 116-125. The original publication is available at http://www.sciencedirect.com DOI : 10.1016/j.chemolab.2012.11.002 ha l-0 07 80 07 6, v er si on 1 23 J an 2 01 3 Author manuscript, published in ""

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PLS-regression: a basic tool of chemometrics

Ž . Ž PLS-regression PLSR is the PLS approach in its simplest, and in chemistry and technology, most used form two-block . predictive PLS . PLSR is a method for relating two data matrices, X and Y, by a linear multivariate model, but goes beyond traditional regression in that it models also the structure of X and Y. PLSR derives its usefulness from its ability to analyze data with many, noisy, ...

متن کامل

Modeling and Mapping of Soil Salinity with Reflectance Spectroscopy and Landsat Data Using Two Quantitative Methods (PLSR and MARS)

The monitoring of soil salinity levels is necessary for the prevention and mitigation of land degradation in arid environments. To assess the potential of remote sensing in estimating and mapping soil salinity in the El-Tina Plain, Sinai, Egypt, two predictive models were constructed based on the measured soil electrical conductivity (ECe) and laboratory soil reflectance spectra resampled to La...

متن کامل

Spectroscopic Determination of Aboveground Biomass in Grasslands Using Spectral Transformations, Support Vector Machine and Partial Least Squares Regression

Aboveground biomass (AGB) is one of the strategic biophysical variables of interest in vegetation studies. The main objective of this study was to evaluate the Support Vector Machine (SVM) and Partial Least Squares Regression (PLSR) for estimating the AGB of grasslands from field spectrometer data and to find out which data pre-processing approach was the most suitable. The most accurate model ...

متن کامل

Prediction of Biomass Production and Nutrient Uptake in Land Application Using Partial Least Squares Regression Analysis

Partial Least Squares Regression (PLSR) can integrate a great number of variables and overcome collinearity problems, a fact that makes it suitable for intensive agronomical practices such as land application. In the present study a PLSR model was developed to predict important management goals, including biomass production and nutrient recovery (i.e., nitrogen and phosphorus), associated with ...

متن کامل

Predicting Grassland Leaf Area Index in the Meadow Steppes of Northern China: A Comparative Study of Regression Approaches and Hybrid Geostatistical Methods

Leaf area index (LAI) is a key parameter used to describe vegetation structures and is widely used in ecosystem biophysical process and vegetation productivity models. Many algorithms have been developed for the estimation of LAI based on remote sensing images. Our goal was to produce accurate and timely predictions of grassland LAI for the meadow steppes of northern China. Here, we compare the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013